Statistics-Driven Localization of Dissimilarities in Data

نویسندگان

  • Alexey Karimov
  • Gabriel Mistelbauer
  • Thomas Auzinger
  • Eduard Gröller
چکیده

The identification of dissimilar regions in spatial and temporal data is a fundamental part of data exploration. This process takes place in applications, such as biomedical image processing as well as climatic data analysis. We propose a general solution for this task by employing well-founded statistical tools. From a large set of candidate regions, we derive an empirical distribution of the data and perform statistical hypothesis testing to obtain p-values as measures of dissimilarity. Having p-values, we quantify differences and rank regions on a global scale according to their dissimilarity to user-specified exemplar regions. We demonstrate our approach and its generality with two application scenarios, namely interactive exploration of climatic data and segmentation editing in the medical domain. In both cases our data exploration protocol unifies the interactive data analysis, guiding the user towards regions with the most relevant dissimilarity characteristics. The dissimilarity analysis results are conveyed with a radial tree, which prevents the user from searching exhaustively through all the data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study of the Fundamental Relationships between Deformation-Induced Surface Roughness and Strain Localization in AA5754

Three-dimensional, matrix-based statistical analysis methods were developed and integrated with high-resolution topographical imaging, to assess how microstructural changes influence the evolution of plastic deformation and strain localization in a commercial AA5754-O aluminum sheet in three in-plane strain modes. Analysis of the raw surface data revealed that the general composition of the sur...

متن کامل

A novel method for detecting structural damage based on data-driven and similarity-based techniques under environmental and operational changes

The applications of time series modeling and statistical similarity methods to structural health monitoring (SHM) provide promising and capable approaches to structural damage detection. The main aim of this article is to propose an efficient univariate similarity method named as Kullback similarity (KS) for identifying the location of damage and estimating the level of damage severity. An impr...

متن کامل

A Model-Driven Decision Support System for Software Cost Estimation (Case Study: Projects in NASA60 Dataset)

Estimating the costs of software development is one of the most important activities in software project management. Inaccuracies in such estimates may cause irreparable loss. A low estimate of the cost of projects will result in failure on delivery on time and indicates the inefficiency of the software development team. On the other hand, high estimates of resources and costs for a project wil...

متن کامل

English Vocabulary Learning Strategies: the Case of Iranian Monolinguals vs. Bilinguals

The main objective of the present study was to investigate the differences between Iranian EFL monolinguals and bilinguals in terms of vocabulary language learning strategies. In fact, it was an attempt to investigate whether bilingual/ monolingual learners differ significantly in using vocabulary learning strategies. To this end, 70 EFL, 45 monolingual (Persian) and 25 bilingual (Arabic-Persia...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016